Visual Speech Recognition with Lightweight Psychologically Motivated Gabor Features
نویسندگان
چکیده
منابع مشابه
Articulatory motivated acoustic features for speech recognition
In this paper, we consider the use of multiple acoustic features of the speech signal for continuous speech recognition. A novel articulatory motivated acoustic feature is introduced, namely the spectrum derivative feature. The new feature is tested in combination with the standard Mel Frequency Cepstral Coefficients (MFCC) and the voicedness features. Linear Discriminant Analysis is applied to...
متن کاملPsychologically Motivated Text Mining
Natural language processing techniques are increasingly applied to identify social trends and predict behavior based on large text collections. Existing methods typically rely on surface lexical and syntactic information. Yet, research in psychology shows that patterns of human conceptualisation, such as metaphorical framing, are reliable predictors of human expectations and decisions. In this ...
متن کاملObtaining Psychologically Motivated Spaces with MDS
The main purpose with this paper is to describe how a psychologically motivated conceptual space can be obtained with MDS (multidimensional scaling) and how it can be expressed in terms of a more primitive “physical” (or mathematical) one. The idea is demonstrated practically with the aid of two experimental pilot studies. The paper is concluded by a critical discussion of the method used.
متن کاملVoiceless Speech Recognition Using Dynamic Visual Speech Features
This paper describes a voiceless speech recognition technique that utilizes dynamic visual features to represent the facial movements during phonation. The dynamic features extracted from the mouth video are used to classify utterances without using the acoustic data. The audio signals of consonants are more confusing than vowels and the facial movements involved in pronunciation of consonants ...
متن کاملAudio-Visual Speech Recognition Using MPEG-4 Compliant Visual Features
We describe an audio-visual automatic continuous speech recognition system, which significantly improves speech recognition performance over a wide range of acoustic noise levels, as well as under clean audio conditions. The system utilizes facial animation parameters (FAPs) supported by the MPEG-4 standard for the visual representation of speech. We also describe a robust and automatic algorit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Entropy
سال: 2020
ISSN: 1099-4300
DOI: 10.3390/e22121367